Numerous other elimination techniques will be apparent to those of ordinary skill in 
the art. ~ 



In The Claims: 



Please cancel claims 1-9 without prejudice. Please add new claims 10-32. 



Presentation Of The Claims In A Clean-Unmarked Format 



1^ 



10. (New) A processor comprising: 

a memory to store a first packed data operand having a first plurality of data 
elements ano^a second packed data operand having a second plurality of data 
elements; 

a partial-widthlpacked data instruction to indicate the first packed data operand 
and the second packed data operand and to indicate a first operation to be 
performed on a subset of corresponding pairs of data elements of the first and the 
second packed da|ta operands; 

a decoder coupled With the memory to receive the partial-width packed data 
instruction and to decode the partial-width packed data instruction; and 

a partial-width execution unit coupled with the decoder to execute the operation 
on the subset of corresponding pairs of data elements. 

1 1 . (New) The processor ©f claim 10: 

wherein the decoder is a decoder to convert the partial-width packed data 
instruction into a first mlicfo instruction that corresponds to a first subset of at 
least one corresponding pair of data elements of the first and the second packed 
data operands and a seconc^micro instruction that corresponds to a second subset 
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of at least one correspond/ng pair of data elements of the first and the second 
packed data operands; 

wherein the partial-widtm execution unit is a partial-width execution unit to 
execute an operation specified by the first micro instruction on the first subset. 

12. (New) The processor olf claim 11, fiirther comprising a port to receive at least one 
data element of the firsjt subset and to not receive a data element of the second 
subset. 

1 3 . (New) The processo J of claim 1 1 : 

wherein the processor is a processor to eliminate the second micro instruction; 
and 



wherein the processor is a processor to set at least one result data element 
corresponding to tme second subset to a predetermined value. 

14. (New) The processor of claim 1 1 : 



fiirther comprisin; 
subset; and 



wherein the partif 
execute an operation 
the second subset 



15. (New) The proc( ssor of claim 10: 



fiirther comprisir 
data operand and 
simultaneously 
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delay circuitry to delay execution of operations on the second 



1-width execution unit is a partial-width execution unit to 

specified by the second partial-width micro instruction on 
after the delay. 



g a first port coupled with the memory to receive the first packed 
a second port coupled with the memory to substantially 
receive the second packed data operand; 
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further comprising divide circuitrylto divide the first packed data operand into a 
first subset comprising at least one data element and a second subset comprising 
at least one data element and to divide the second packed data operand into a third 
subset comprising at least one djlta element and a fourth subset comprising at least 
one data element; and 

wherein the partial-width execution unit is a partial-width execution unit to 
perform the first operation ori at least one corresponding pair of data elements of 
the first and the third subset Jto generate at least one resulting data element. 

1 6. (New) The processor of claim 1 5 : 

fiirther comprising delay circuitry to delay the second subset and to delay the 
fourth subset; and 

wherein after the delay, the partial-width execution unit is a partial-width 
execution unit to perforra the first operation on at least one corresponding pair of 
data elements of the secAnd and the fourth subsets to generate at least one 
additional resulting data element. 



17. (New) The processor 
partial-width execution 
element correspondin 
one additional resulting 

18. (New) The processor 



f claim 15, wherein the partial-width execution unit is a 
unit to generate at least one additional resulting data 
to the second and the fourth subsets by setting the at least 
data element to a predetermined value. 



of claim 15, wherein the partial-width execution unit is a 
partial-width execution unit to execute the packed data instruction and a second 
similar packed data i istruction on a half clock cycle. 
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19. (New) The processqr of claim 15, wherein the partial- width execution unit is a 
64-bit partial-width execution unit, and wherein the first and the second packed 
data operands are 128-ait operands. 

20. (New) A method comprismg: 

receiving a packed data instruction that specifies memory locations of a first fiiU- 
width packed data operand having a plurality of data elements and a second fiiU- 
width packed data operand having a corresponding plurality of data elements; 

substantially simultaneously acce\mg the first fixU-width packed data operand 
and the second full-width packed data operand fi'om the memory locations; 

dividing the first full-width packed d^a operand into a first subset of data 



elements and a second subset of data elements and dividing the second full-width 
packed data operand into a third subset o^data elements and a fourth subset of 
data elements; 



performing an operation specified by the packed data instruction on the first and 
third subsets of data elements to generate a first^esulting one or more data 
elements; 

delaying the second and fourth subsets of data elements; 

after said delaying, performing an operation specified By the packed data 
instruction on the second and the fourth subsets of data ^ements to generate a 
second resulting one or more data elements; and 

storing the first and the second resulting data elements in a cJynmon packed data 
operand. 
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22. 



24. 



macro instruction on the 



21 . (New) The method of cl lim 20, wherein performing an operation specified by the 



econd and fourth subsets comprises setting a data 



element to a predetermined value. 

(New) The method of dlaim 20, wherein dividing includes dividing a 128-bit 
packed data operand inljo a 64-bit segment of two low order data elements and a 



64-bit segment of two 



23. (New) A processor cor iprising 



a packed data instructic 
at least one packed dat i 

a decoder to generate 



an execution unit to 
on only a subset of thfe 



igh order data elements. 



n to specify an operation on a plurality of data elements of 
operand; 

first micro instruction and a second micro instruction 



corresponding to the p acked data instruction, the first micro instruction specifying 



a first operation and the 



second micro instruction specifying a second operation; 



execute an operation specified by the first micro instruction 
plurality of packed data elements; and 



circuitry to eliminate ;he second micro instruction. 

(New) The processor of claim 23, wherein the decoder is a decoder to create the 
second micro instruction by replicating the first micro instruction to create a 
repUca and modifying the replica to create the second micro instruction. 
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25. (New) The processor of c|aim 23, wherein the execution unit is an execution unit 
to set a data element in a rpsult packed data operand to a predetermined value. 

26. (New) A method compriang: 

receiving a packed data mstruction specifying memory locations of a first packed 
data operand and a second packed data operand; 

converting the packed Jata instruction into a first packed data micro instruction 
and a second packed data micro instruction; 

executing the first pacied data micro instruction including accessing only a subset 
of data elements of tha first and the second packed data operands comprising at 
least one pair of corresponding data elements from the first and the second packed 
data operands and causing an operation specified by the packed data instruction to 
be performed on the subset to produce a resulting one or more data elements; and 

executing the second packed data micro instruction including accessing only a 
subset of data elemepts of the first and the second packed data operands 
comprising at least Ane pair of corresponding data elements from the first and the 



second packed data 
data instruction to b 



operands and causing an operation specified by the packed 
e separately performed on the subset to produce a resulting one 



or more additional ( ata elements. 



27. (New) The method 
instruction includes 
elements to a predetermined 



)f claim 26, wherein executing the second packed data micro 
setting a data element of the one or more additional data 
value. 
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28. (New) The method of claSm 26, further comprising: 

writing the resulting onejor more data elements to a result packed data 
operand; and 

writing the resulting on^ or more additional data elements to the same result 
packed data operand. 

29. (New) The method of qlaim 26, wherein executing the second packed data 
micro instruction is dejayed relative to executing the first packed data micro 
instruction. 

30. (New) A processor comprising: 



a memory to store a 
operand; 



an instruction to 
data operand and to 
data operand and the 



■St packed data operand and a second packed data 



indicate the first packed data operand and the second packed 
ifidicate an operation to be performed on a the first packed 
second packed data operand; 



decoder means to de ^ode the instruction; and 



execution means to <;xecute the instruction. 



3 1 . (New) The processc r 
means to decode th( 
operation on only 
and a second micro 
portion of the first 
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of claim 30, wherein the decoder means is a decoder 
instruction into a first micro instruction that specifies an 
alportion of the first and the second packed data operands 
instruction that specifies an operation on only a different 
the second packed data operands. 
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32. (New) The processor of claim 30, wherein the execution means is an execution 
means to perform operations specified by the instruction on a first subset of 
corresponding pairs of data elements of the first and the second packed data 
operands and after a aehy to perform operations specified by the instruction 
means on a second sunset of corresponding pairs of data elements of the first and 
the second packed dataloperands. 
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